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Abstract 



The Erdos-Renyi process begins with an empty graph on n vertices, 
with edges added randomly one at a time to the graph. A classical re- 
sult of Erdds and Renyi states that the Erdos-Renyi process undergoes 
a phase transition, which takes place when the number of edges reaches 
n/2 (we say at time 1) and a giant component emerges. Since this sem- 
inal work of Erdds and Renyi, various random graph models have been 
introduced and studied. In this paper we study the Bohman-Frieze 
process, a simple modification of the Erdos-Renyi process. 

The Bohman-Frieze process also begins with an empty graph on n 
vertices. At each step two random edges are presented, and if the first 
U: edge would join two isolated vertices, it is added to a graph; otherwise 

r~| ' the second edge is added. We present several new results on the phase 

transition of the Bohman-Frieze process. We show that it has a qual- 
itatively similar phase transition to the Erdos-Renyi process in terms 
of the size and structure of the components near the critical point. We 
prove that all components at time t c — e (that is, when the number 
C*~) , of edges are (t c — e)"/2) are trees or unicyclic components and that 

^f" the largest component is of size i7(e~ 2 logn). Further, at t c + e, all 

components apart from the giant component are trees or unicyclic and 
the size of the second-largest component is 0(e -2 logn). Each of these 
f^S , results corresponds to an analogous well-known result for the Erdos- 

Renyi process. Our proof techniques include combinatorial arguments, 
, the differential equation method for random processes, and the singu- 

larity analysis of the moment generating function for the susceptibility, 
which satisfies a quasi-linear partial differential equation. 



X ■ 1 Introduction 

5-H 



Erdos and Renyi began the study of random graphs in 1959 with their 
landmark paper 'On the Evolution of Random Graphs' [33]. They described 
the changes in the structure of a graph as it evolves from empty (no edges) 
to full (all edges present), with a uniformly chosen random edge added at 
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each step. There are three closely related models of the Erdos-Renyi random 
graph: (i) the binomial random graph G(n,p), the graph on n vertices in 
which each edge is present independently with probability p; (ii) the uniform 
random graph G(n,m), which is a graph chosen uniformly from all graphs 
of n vertices and m edges; (hi) the Erdos-Renyi random graph process, 
which starts with an empty graph on n vertices and at each step a random 
edge selected uniformly and independently from the set of potential edges is 
added to the graph. The models are related as follows: G(n,p) and G(n, m) 
are essential equivalent for the correct choice of m and p, and the mth step 
of the random graph process has the distribution of G(n,m). 

The most striking result from [13] was the discovery of a 'double jump' 
in the largest component size at ^ edges. With ^ edges, c < 1 a constant, 
all connected components of the random graph are of size O(logn) with 
probability 1 — o(l). With ^ edges, the largest component is of size 9(n 2 / 3 ). 
And with ^ edges, c > 1, there is a 'giant component' of size 0(n) and 
all other components are of size O(logn). This rapid transformation of the 
graph is called the 'phase transition' of the random graph and is related to 
similar phase transitions in percolation models in mathematical physics. 

The phase transition of the Erdos-Renyi random graph is now under- 
stood in great detail. For a comprehensive account of the results see [I], 
[12j . |20| . In particular, the correct scaling for the critical window is known, 
as well as the distribution of the size and structure of components at each 
period of the phase transition. 

Motivated by the successes of studying the Erdos-Renyi random graph, 
researchers have studied many different modifications of the random graph 
process. These modifications range from the study of random graphs with 
a given degree sequence [23], [29J, [30], to random planar graphs [22], [28] . 
to random simplicial complices [26]. 

One particular modification proposed by Achlioptas was to use the 'power 
of two choices' to create processes with different behavior. In an Achlioptas 
process, two randomly sampled potential edges are presented at each step, 
then one of them is chosen according to a given rule and added to the graph. 
The choice of edge can depend on which edges were added to the graph in 
previous steps. Each different rule corresponds to a different random graph 
process. The Erdos-Renyi process itself comes from the rule 'always add 
the first edge' (or the rule 'always add the second edge'). Achlioptas asked 
whether there was a rule that pushed back the emergence of a giant compo- 
nent. In [6], Bohman and Frieze answered his question in the affirmative by 
analyzing the following rule: add the first edge if it would join two vertices 
that did not intersect any of the previously presented edges; otherwise add 
the second edge. Their work showed that the choice of two edges at each 
step of the process could indeed cause different behavior, and initiated two 
major threads of research into Achlioptas processes. 

The first is concerned with testing the power and limits of Achlioptas 
processes. How much can we accelerate or delay the phase transition? How 
long can we delay the formation of a Hamiltonian cycle? These questions 
can be asked in the original context of an Achlioptas process or in the off-line 
case in which all pairs of edges are given first, then the choices are made. The 
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results often generalize to the case in which k edges are presented at each 
step instead of 2. Recent results concern (i) the acceleration or delaying of 
the phase transition [3] , [6] , [T] , [9] , [18] ; (ii) the avoidance of small subgraphs 
|24j . [31] . [32] ; (hi) the acceleration of the appearance of Hamiltonian cycles 

The second thread of research involves choosing one hxed Achlioptas 
rule and exploring the fine details of its evolution. We will focus on this 
second thread of research in this paper, and in particular concentrate on the 
Bohman- Frieze process, a process very similar to the process in [6]. The rule 
for what is now known as the Bohman- Frieze process is as follows: if the 
first edge would join two isolated vertices, add it; otherwise add the second 
edge. The Bohman-Frieze process is a simple modification of the Erdos- 
Renyi process with dependence between the edges, and different behavior: 
the emergence of the giant component is in fact delayed. The motivation 
for studying its finer behavior is to develop methods for analyzing random 
graph processes with dependence between the edges and to understand how 
universal the Erdos-Renyi phase transition is. The Bohman-Frieze process is 
also shorthand for a much wider class of Achlioptas rules, so-called 'bounded- 
size rules' introduced in [33]. In a bounded-size rule, the choice between the 
two edges can only depend on the sizes of the four components that may be 
connected, and all components of size > K must be treated the same, for 
some fixed constant K. Most of the results on the phase transition of the 
Bohman-Frieze process, in |21j.|34j. and in this work, can be transferred in 
a straightforward way to all bounded-size rules. 

While the phase transition of the Bohman-Frieze process is delayed, it 
shares many qualitative similarities to that of the Erdos-Renyi process. In 
this paper we strengthen these qualitative similarities, presenting results on 
the size and structure of the largest and second-largest components near 
the critical point. We also prove an asymptotic formula for the fraction 
of vertices in components of a given size. As in previous analysis of the 
Bohman-Frieze process, we use the differential equation method for random 
processes. But here, using the ordinary generating functions arising from the 
Bohman-Frieze process, we derive a quasi-linear partial differential equation 
that tracks key statistics of the process. Then we apply the singularity im- 
plicit functions theorem and the transfer theorem from the work of Flajolet 
and Odlyzko [16J and Flajolet and Sedgewick [T7] . 

Before we present our main results, we introduce notation and concepts 
that will be used throughout the paper. 

2 Preliminaries 

Throughout the paper we consider the following version of the Bohman- 
Frieze process: at each step, we select two edges uniformly at random (with 
replacement) from the set of edges not yet present in the graph. If the first 
edge would join two isolated vertices, we add it. Otherwise, we add the 
second edge. 
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Notation. We write 'with high probability' or 'whp' if an event holds 
with probability — > 1 as n — > oo (other authors sometime use 'aas' or 
'asymptotically almost surely'). We use the standard asymptotic notation 
O(-), o(-), O(-), f2(-), and w(-). We often combine the asymptotic notation 
with a statement about probability: whp g n = 0(f(n)) means that there 
exists a constant K so that g n < Kf(n) whp; and whp g n = o(/(n)) means 
that for every constant 5 > 0, g n < 5f(n) whp. We sometimes use asymp- 
totic notation as n — > oo and e — > 0, where e is independent of n: whp 
g{n,e) = 0(/(n,e)) means that there exist constants K and eo, such that 
for all < e < eo, g(n, e) < Kf(n, e) with probability — > 1 as n — > oo, where 
the convergence need not be uniform in e. 

For i,j E N, Cj will refer to the ith largest connected component of a 
graph. X\{j) will denote the number of isolated vertices at step j of the 
process, and similarly Xi(j) will be the number of vertices in components of 
size i at step j of the process. Our main results concern the size of structure 
of connected components in the random graphs. In terms of structure, a 
'simple' component is either a tree or unicyclic (if a component is simple 
and has k vertices, then it has k — 1 or k edges and is a tree or unicyclic 
component respectively). A 'complex' component is a connected component 
with two or more cycles, or > k + 1 edges if it has k vertices. 

The Erdos-Renyi random graph undergoes a phase transition around 
step l|. So, we will parameterize time in a standard way 

t _ # of edges 

n/2 1 ' 

so t is the average degree in the graph. We write G(t) for a graph with 
edges in a random graph process. We denote the size of a component 
C by \C\, so |Ci(t)| is the size of the largest component of the random 
graph process at time t. The critical point t c for the phase transition is 
defined so that for all fixed t < t c , \Ci(t)\ = O(logn) and for all fixed 
t > t c , |Ci(t)| = G(n). For the Erdos-Renyi process, t c = 1, while for the 
Bohman- Frieze process t c is approximately 1.176 [34] . 

The Differential Equation Method. A key method used in studying 
random graph processes is the 'Differential Equation Method'. Wormald 
[35], [36j has proved several general theorems about the method. The 
method was used by Bohman [5j and by Bohman and Keevash [8] to analyze 
the triangle-free process and the H-free process respectively. It has previ- 
ously been used by Spencer and Wormald [Sj, Bohman and Kravitz |10j . 
and Janson and Spencer [21] in analyzing the phase transition of Achlioptas 
processes. 

The method is used to show that a random quantity, X{(j) for exam- 
ple, is tightly concentrated under the right scaling around a deterministic 
function throughout the course of a random graph process (using sub- and 
supermartingales). The deterministic function is the solution to a system of 
ordinary differential equations. Applying Wormald's theorem (e.g. Theorem 
5.1 in [35]) on the differential equation method, Spencer and Wormald [M] 
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show that in a bounded-size Achlioptas process, the proportion of vertices 
in components of size i, for fixed i, is concentrated around a deterministic 
function Xi(t). In Section [6.21 we study this system of ordinary differential 
equations for the specific case of the Bohman-Frieze process. 

Susceptibility. The susceptibility of a graph G is defined as the expected 
component size of a randomly chosen vertex: 

^( G ) = ~Ei^)i = ~Ei^i 2 > ( 2 ) 

V d 

where the second sum is over all connected components and K (v) denotes the 
component containing the vertex v. The name 'susceptibility' comes from 
the study of percolation on infinite lattices in mathematical physics (see, for 
example, [19J). There, the susceptibility is defined to be the expected size 
of the connected cluster containing the origin. 

We can also define higher moments of the susceptibility: 

S k (G) = W\ K ^)\ k = lY,\ C ^ +1 - ( 3 ) 

v C t 

Using the differential equation method, Spencer and Wormald [M] showed 
that for t < t c , whp S^Gj) = si(^) + o(l) for some deterministic function 
si(t) which blows-up at t = t c . Janson and Spencer [21] extend this proof to 
Sf- for any fixed k. The deterministic functions Sfc(i) are given as the solution 
to a system of ODE's. We write these ODE's for the Bohman-Frieze process 
and analyze them in Section [6.21 The blow-up point of each of s^'s is t = t c 
[33]. We will write S(j) for S\(Gj) and s(t) for s\(t) if we are considering 
the susceptibility apart from the higher moments. 

Singularity Analysis. Given an ordinary generating function f(z) := 
S«>o fi z ! > we sometimes use the notation [z 1 ] f(z) to mean /j. When /j > 
for all i > 0, by Pringsheim's theorem [17] , among the singularities of f(z) 
which are closest to the origin, there is one with a positive real value, which 
is called the dominant singularity p of f(z). In this paper we will deal with 
generating functions f(z) which have a singular expansion at p of the form 

f{z)=g{z)-h{z){l-z/pff 2 , 

for some functions g(z), h{z) analytic at p, and which are A-analytic, that is, 
they can be analytically continued to the so-called A-domain A = {z \ \z\ < 
p + e, z ^ p, | arg(z — p)\ > <f\ for some e > and < <j> < n/2. There- 
fore, applying the transfer theorem by Flajolet and Odlyzko [16] (see e.g. 
Theorem VI.3 [T7]): for a < f3 

[A ((1 - zip)* + Oil - z/pf) = [z*] (1 - zIpT + O ([*<] (1 - z/pf) 

and the basic scaling (Chapter 6, [T7]): 

[A (i - z/p) 1 / 2 = --L= r 3 / 2 p -* (i + o(i/i)), 
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we obtain asymptotics for the coefficient [z l ] f(z) of the form 

M/W = ^r 3 /V*(i + o(iA)). 
3 Main Results 

There is a strong sense that the phase transition of the Bohman- Frieze pro- 
cess is qualitatively the same as that of the Erdos-Renyi process: while 
certain constants involved may differ, the behavior is the same. In statisti- 
cal physics, a group of processes with the same critical exponents is said to 
belong to the same 'Universality Class'. In that spirit, much of the work on 
the Bohman-Frieze process has been aimed at investigating its phase tran- 
sition and proving results analogous to what we already know about the 
Erdos-Renyi phase transition. 

Our results in this paper are in the barely sub- and supercritical regimes, 
t c ± e (Theorems [TJ [2j and [3|). Asymptotics involving both n and e in 
Theorems [JJ and [5] are given by first fixing e, letting n — > oo, then letting 
e — > 0. Much is known for the Erdos-Renyi process where e = e(n) — > 0, 
but here we keep e fixed relative to n. A recent preprint [3] investigates 
component sizes inside the critical window. We mention previous work in 
each regime and the analogous Erdos-Renyi properties, then we give our 
new results. 

3.1 Critical Point 

Bohman and Frieze initiated the study of Achlioptas processes by exhibiting 
a process with t c > 1. Spencer and Wormald [34J and Bohman and Kravitz 
[TO] showed that the precise value of t c for any bounded-size Achlioptas 
processes (including the Bohman-Frieze process) can be expressed as the 
blow-up point of si(i), the deterministic function around which the sus- 
ceptibility concentrates. The corresponding function for the Erdos-Renyi 
process is s\(t) = j^, which blows up at t = 1. For the Bohman-Frieze 
process, t c can be computed numerically and is approximately 1.176. 

3.2 Growth of Early Giant 

In the Erdos-Renyi random process, the fraction of vertices in the largest 
components at time t is p(t)+o(l) whp, where p(t) is the survival probability 
of a Poisson Galton- Watson branching process with mean t. For t < 1, 
p(t) = and for t > 1, p is the non-zero solution to the following equation: 

1 - p = e~ tp . 

In the barely super-critical regime, this gives p(l + e) ~ 2e as e — > 0; i.e. the 
giant component grows linearly after the critical point with an initial rate 
of 2. 

For the Bohman-Frieze process, no formula is known for the size of the 
giant component at an arbitrary t, but in the barely super-critical regime, 
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Janson and Spencer showed [21] that there exist constants 7, K so that whp: 



7 e - Ke 4/3 < \d(t c + e)\/n < 7 e + Ke 4/3 . 



This roughly says that the giant component of the Bohman-Frieze process 
grows linearly after the critical point with an initial rate of 7. 7 can be 
computed numerically and is approximately 2.463. Their method involves 
approximating the survival probability of a multi-type Poisson branching 
process using the differential equation method and the first three moments 
of the susceptibility. 

3.3 Barely Subcritical Regime 

For the Erdos-Renyi process, in the barely subcritical regime, whp all com- 
ponents are of size O(logra) and are simple. Indeed, whp |Ci(t c — e)| = 
B(e~ 2 logn). The distribution of the number of cycles is also known (see 
[llj . [12j). In fact, Flajolet, Knuth and Pittel [TS] found the distribution 
of the length of the first cycle to appear in the Erdos-Renyi process. For 
the Bohman-Frieze process, Spencer and Wormald [34] showed that whp 
\Ci(t c — e)| = O(logn) with a constant in the O(-) that depends exponen- 
tially on e ; in Section l4T2l we will show that whp \C\{t c —e)\ = Cl(e~ 2 logn), 
and in Conjecture [1] we conjecture a matching upper bound. We also de- 
scribe the component structure at t c — e: 

Theorem 1. At t c — e, whp every component in the Bohman-Frieze process 
is simple. The expected number of unicyclic components is ~ \ log - as 
e — > 0, and the probability that there are no cycles is ~ y/e. 

3.4 Barely Supercritical Regime 

In the Erdos-Renyi process, there is a symmetry between the subcritical 
graph and the supercritical graph with the giant removed (see [27]). C2{t c + 
e) therefore is very close in distribution to C\{t c — e). In particular, whp 
C2{t c + e) is simple and | C2 (i c + e) | = 0(e~ 2 logn). While no corresponding 
symmetry theorem for the Bohman-Frieze process is known, we find the size 
and structure of the smaller components: 

Theorem 2. Whp | C 2 (i c + e)| = 0(e~ 2 logn). Moreover, whp all compo- 
nents apart from C\ are simple. 

3.5 Vertices in Small Components 

The differential equation method shows [31] that in a bounded-size Achliop- 
tas process, the proportion of vertices in components of size i, for i fixed 
relative to n, is whp concentrated around a deterministic function Xi{t): 





(4) 



n 
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For the Erdos-Renyi random process, there is a precise formula for Xi(t) 
(see Equation (|16|) ). which yields the following asymptotics as i — > oo: 

x,(t e ±e) = ^ r 3/2 e -|D( e) A (1 + (5) 
\/2tt 



Here C(e) = 1 + 0(e) and D(e) = 1 + O(e). 

In the Bohman-Frieze process, because of the dependence between edges, 
we cannot derive an exact formula similar to Equation (|16p. Instead, we 
derive an asymptotic formula for Xi(t) similar to ([5]). To this end, we use 
the generating function P(t,z) = ^2iXi(t)z l , which is related to the higher 

moments of the susceptibility by 9 = s k(t)- in Section IfOI we show 

° z z=l 

that Xi(t) is determined by the following system of ODE's: 

x[(t) = -xi(t) -x\{t) + x\{t) 

x' 2 (t) = 2x\{t) - xj(t) - 2(1 - xj(t))x 2 (t) 

x'S) = |(1 - x\(t)) x k {t) Xi - k {t) - i(l - xl(t))xi(t), i > 2 

k<i 

and therefore P(t, z) satisfies a non-homogeneous quasi-linear PDE: 
_ z{1 _ 4 mP (t, z) - l) 9 -^ = z(z - l)xl(t) 

with the initial condition P(0, z) = z. Its solution is implicitly defined by a 
single function. Applying the singular implicit functions theorem (see e.g. 
Lemma VII. 3. in [17j ) to that solution, we derive the following asymptotic 
formula for Xi(t): 

Theorem 3. Let Xi(t) be the solution of the above ODE's. Then 

Xi (t c ±e) = C(e)r 3 /V D ^ 2i (l + 0(l/i)), (6) 
where C{e) = c + O(e) and D(e) = d + O(e) for absolute constants c,d. 

Notice that Theorem [3] is a statement about the asymptotic behavior of 
the infinite sequence of deterministic functions {xi(t)}'?2 =1 (which does not 
involve n at all), not about the random graph process. In particular, we do 
not apply Theorem [3] for i = i(n). Instead we will apply it for i = 0(e~ 2 ) 
in proving Theorem [21 for e > a constant with respect to n. 

In the rest of the paper we prove our main results; in Section [U Theorem 
[TJ in Section [5j Theorem (2J and in Section [61 Theorem [3j 

4 Subcritical Regime: Proof of Theorem 1 
4.1 Complexity of Components 

Our results in the barely subcritical case state that, as in the Erdos-Renyi 
process, no complex connected component emerges in the Bohman-Frieze 
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before t c — e for any fixed e > 0. Thus the Bohman-Frieze rule not only 
delays the formation of a giant component but also the formation of bicyclic 
components. Our proof makes a connection between the rate at which uni- 
cyclic components are formed and the susceptibility of the graph. 

To prove Theorem [1] we first prove the following lemma concerning the 
number of cyclic components in the subcritical Bohman-Frieze process: 

Lemma 1. At time t c — e, the number of cyclic components in the Bohman- 
Frieze process is asymptotically Poisson with mean fi e , and fx e ~ \ 1°S as 
e->0. 

Proof. We first begin by fixing e > and conditioning on the concentration 
of the values of X\{j) and S{j). Spencer and Wormald [M] show that 
with error probability 0(exp(— n 1 ' 5 )) we have — 2 = x\(t) + o(l) and 
S(j) = s(t) + o(l) for all t<t c -e. 

At step j of the Bohman-Frieze process, conditioned on the event that X\ 
and S are concentrated near their deterministic counterparts, the probability 
that an edge is added within an existing component is 

fl " ^f) E = 1 ( 1 " xl(2j/n))s{2j/n) + o(n^). 

We run the process for ( fc ~ e ) n steps, and so the number of cyclic components 
created converges to a Poisson random variable (see Barbour, Hoist, Janson 
[2]) with mean 

(t c — e)n 

Me = ~ V (l-xl(2j/n))s(2j/n)+o(l), 
i=i 

where the o(l) is the error in the approximation of x\ and s. We can 
approximate /j, e by an integral as n — > oo, with dt ~ ^dj: 

^ = \ / <C \l-x 2 1 (t))s(t)dt + o(l). (7) 
1 Jt=o 

To compute this integral, we need an expression for s(t). From Spencer 
and Wormald |34] we have 

S '(t) = x 2 1 (t) + (l-x 2 1 (t))s 2 (t) 

with the initial condition s(0) = 1. In particular, this differential equation 
blows up at t = t c . As we are interested in the limit as e — > 0, we need the 
asymptotics of s(t c — e) as e — > 0. We compute: 

-)■ -(1 - x\(t c )) as t — > t c . 

So g^-e) ~ e (l — x i(*c)) and s(t c — e) ~ ^(l-i^t )) " ^ ow we integrate the 
right-hand side of equation ([7]) and find fx e ~ | log - as e — >• 0. 

□ 
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We now complete the proof of Theorem Q] and prove that whp there are 
no complex components at t c — e. At step j, the probability of creating a 
bicyclic component is 

V n z J n z 

Ci,Ck unicyclic 

The process at t c — e is subcritical so \Ci\ < K(e) logn for all i whp, where 
K (e) is a constant independent of n (see [34]). and from Lemma[IJ whp the 
number of unicyclic components is < logn (or any other function r(n) = 
cj(1)). The expected number of bicyclic components created before t c — e is 
therefore 

. n K 2 (e) log 4 n , . 
so whp there are no complex components at t c — e. 

4.2 Size of the Largest Subcritical Component 

Theorem U] provides the lower bound of f2(e~ 2 logn) for \C\(t c — e)|. 

Theorem 4. There exists an absolute constant K so that whp, 

\Ci{t c -e)\ >Ke- 2 \ogn. (8) 

Proof. We will use the so-called "sprinkle" from t c — 2e to t c — e; that is, we 
first run the Bohman-Frieze process to time t c — 2e, and then, conditioned 
on the resulting graph with certain properties that hold whp, we run the 
Bohman-Frieze process from that graph to time t c — e. 

We begin at time t c — 2e and call vertices in components of size between 
m\e~ 2 and m,2e~ 2 medium vertices, where mi < are constants which will 
be determined later. Call the subgraph of medium vertices Gm- We run the 
process from time t c — 2e to t c — e and show that a connected component of 
size > Ke~ 2 logn emerges whp in Gm, f° r some constant K. This suffices 
for a lower bound on \C\\ for the entire graph. We call each connected 
component in Gm at time t c — 2e a 'node'. Using Theorem we know that 
whp the number of nodes in Gm is 

for some constant c(mi, tn^)- Each pair of nodes has at least m 2 e~ 4 potential 
edges between them. So from t c — 2e to t c — e, since we add at least |jj random 
edges as the second in a pair, each pair of nodes is joined with probability 
at least m^e _3 n _1 /20. Now we couple Gm with an Erdos-Renyi random 
graph that has iV = c(mi, m,2)e 3 n nodes and the probability of joining two 
nodes is j? with d at least m 2 c(mi, ma)/20 a constant independent of n. 
From the original results of Erdds and Renyi [13] we know that whp a 
connected component of > K{d) log iV nodes forms in the graph. Since 
each node has > m\t~ 2 vertices, whp we have a connected component with 
> K(m\, m2)vn\€~ 2 logn vertices at time t c — e. 

□ 
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We conjecture a matching upper bouncQ: 
Conjecture 1. There exists a constant K u so that whp 

\Ci{t c -e)\ <K u e- 2 logn. (9) 

5 Supercritical Regime: Proof of Theorem 2 

First we shall prove the following lemmas in Sections 15.14 15, 2\ which corre- 
spond to the first part of Theorem [2] that \C2(t c + e)| = 0(e -2 logn). 

Lemma 2. There exists a constant K u so that whp |C2(t c +e)| < K u e~ 2 log n. 

Lemma 3. There exists a constant Ki so that whp \C2(t c +e)\ > Kie~ 2 logn. 

In Section 15.31 we shall prove the second part of Theorem [2] that all 
components apart from the giant component are simple. 

5.1 Upper bound of \C 2 (t c + e)| 

In order to prove Lemma [2] we use the "sprinkle" from t c + e/2 to t c + e. 
We first run the Bohman- Frieze process up to time t c + e/2 and call the 
resulting graph G. Janson and Spencer [2IJ show that whp G contains a 
giant component C\ of size > yen for some constant y. We let G \ C\ = 
Qsmaii _ ]y ex £ we run £ ne Bohman- Frieze process further up to time t c + e and 
want to show that there exists a constant K u so that whp each component 
of size > K u e~ 2 \ogn in G smal1 at time t c + e is connected to C\, so that 
|C2(^c+e)| < K u e~ 2 logn. To this end we let X be the number of components 
of size > K u e~ 2 logn in 

Qsmall at time ^ + e that 

are not connected to C\ 
and will show that Pr[X > 1] = o(l) by proving EX < o(l). 
Now we write 

EX = ^Pr[ J B v J-Pr[y i ^dl^yj, (10) 

Vi 

where the V^s are all possible sets of vertices in G smal1 at time t c + e of size 
larger than K u e~ 2 logn; By i is the event that V{ forms a connected, isolated 
component in G smal1 at time t c + e; and Vi C\ is the event that component 
Vi is not connected to the largest component at t c + e. Since there can be at 
most n isolated components in G smal1 at time i c + e and so ^2 V . Pr[SyJ < n, 
it suffices to prove that for any V C G smal1 with |V| > K u e~ 2 logn, 

Yr[V w Ci\B v ] < n~ 2 , (11) 

for this implies that KX < n" 1 as desired. 

In order to prove (jlip . we consider the following special pairs of edges 
that may appear at a step in the Bohman- Frieze process between t c + e/2 
and t c + e: call a pair (e, /) of edges red if the first edge e does not join two 

1 Subsequent to this work the authors of [4] have made substantial progress on this 
conjecture, proving an upper bound of 0(e~ 2 (logn) 4 ) on \Ci(t c — e)|. 
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vertices which are isolated at time t c + e/2 and the second edge / joins a 
vertex in C\ to a non- isolated vertex in G smal1 . If such a pair is presented 
at a step in the Bohman- Frieze process between t c + e/2 and t c + e, the 
second edge / will necessarily be added according to the Bohman-Frieze 
rule. Call such a second edge / a red edge. We note that the number of 
red edges added between t c + e/2 and t c + e is distributed as Bin(^pp), 
where p = (1 - xf(t c + e/2)) • £MlzHgihW*c±e/2» (1 + o(1 )). But for an 
upper bound we shall add each red edge independently with probability ^ 
with c chosen small enough so that whp the actual red edges added in the 
Bohman-Frieze process are a superset of these independent edges. Note that 
the choice of red edges does not affect the other edges in the Bohman-Frieze 
process since it does not affect which vertices are isolated. 

Now we return to (fTT]) and note that for any V C G smal1 with |V| > 
K u e~ 2 logn, 

Pr[V <~ d\B v ] < Pr[/x . . . f E G\B V ] (12) 

where Ji-.-Je are the edges between C\ and the vertices of V which were 
not isolated at t c + e/2. In order to bound Pr[/i ... fE ^ G\By] from above, 
we just consider the probability that each /j is added with probability — 
as an independent red edge (as discussed above). To bound the number of 
possible red edges, we color each vertex that was isolated at t c + e/2 orange 
and leave the vertices that were not isolated at time t c + e/2 uncolored, i.e. 
non-orange. We claim the following. 

Proposition 1. There exists a constant K u so that whp there is no con- 
nected component in G smaU at t c + e with > j^K u e~ 2 \ogn orange vertices 
and < j^K u e~ 2 logn non-orange nodes. In particular, whp every connected 
component in the G smal1 subgraph att c + e of size > K u e~ 2 logn has at least 
-^e _2 logn non-orange vertices. 

By Proposition [H there are at least yen ■ -f^e -2 logn = yn ■ -fjy-e -1 logn 
possible red edges that can join V to C\. So, we have 



cc 



yn - e 1 log n 



PT\f 1 ...f E $G\B v ]<(l--) <n~ 2 , (13) 

for K u chosen large enough. From (|12p and ()13|) . we have (jl If) as desired. 

Proof of Proposition^ For notational convenience we let A = K u e~ 2 \ogn. 
We pick an arbitrary vertex v G G smal1 and perform a breadth-first search 
(BFS) of its connected component at t c + e. This method is used to analyze 
the size of components in the Erdos-Renyi process very precisely; here we do 
not have independence between the edges in the Bohman-Frieze process, but 
for this proposition we just need coarse bounds on conditional probabilities. 

Let M > be a constant and let Y be a binomial random variable 
with success (which in the BFS means that a given vertex has an orange 
vertex as a child) probability | and M trials. As we will see below, it is 
useful to consider the probability Q = Pi[Y > I-M]. Using a Chernoff 
bound for a binomial (e.g. |20j), there exists a constant c > so that 
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Q = Pr[Y > §M] < exp(-cM). The latter can be made to be o(n 1 ) for 
suitably large K u . 

We claim that the probability that a particular child of a vertex in the 
BFS is orange is < | conditioned on the previous history of the BFS. As- 
suming the claim, the probability that the component starting from v has 
> -j^A orange vertices and < j^A non-orange vertices (as children in the 
BFS) is less than Q < o(n~ l ). Furthermore, there are n possible vertices 
from which to begin the BFS, so by the union bound the probability that 
a component with > j^A orange vertices has < j^A non-orange vertices is 
o(l). In other words, whp any connected component in G smal1 of size > A 
at t c + e has > j^A non-orange vertices, as desired. 

To prove the claim, we note that without conditioning on the BFS, given 
that we add an edge connected to a fixed isolated vertex v at time t, the 
probability that the second vertex is isolated is 

Xl (t) + (I - xKt^xtjt) 
Pl= s l( t) + (!-*?(*)) +0(1) ' (M) 

and given that we add an edge containing a fixed non-isolated vertex v at 
time t, the probability that the second vertex is isolated is 

p 2 =x 1 (t) + o(l). (15) 

The relevant history of the BFS are the edges incident on v. We are con- 
ditioning on edges present in the graph, and can break up the conditional 
probability into two cases: all the conditioned edges appear later in the 
process than the edge we consider, or at least one conditioned edge appears 
earlier. In the first case, the probability that the edge is added to an isolated 
vertex is at most p±, and in the second case the probability is pi- 

We can bound xi(t) by e~* since xi(0) = 1 and x'^t) = —x\(t) — x\{t) + 
x\ (i), and we can bound t c > 1 by the results of Bohman and Frieze [6] and 
Spencer and Wormald[34J. With these bounds, max.(p\,p2) < § and so the 
claim is proved. 

□ 



5.2 Lower bound of |C 2 (*c + e)| 

It will turn out to be useful to have the following concentration lemma of a 
sum of indicator random variables: 

Lemma 4. Let Y = Y%, where the Y% 's are indicator random variables, 
and both M and the Y{ 's can depend on an underlying parameter n. Let 
jj, = EY and assume fx — > 00 as n — > 00. Assume that for all i 7^ j, 
E\YjYj] < EYiEYj(l + f(n)), where f(n) -> 0. Then Y ~ /i whp. Similarly, 
ifE[Yi\Yj = 1] < EYj(l + f(n)) for all i / j, then Y ~ p whp. 
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Proof. Using Chebyshev's inequality it is enough to prove that j0tQ = o(l). 

var(y) EY + ^EY^-jEY) 2 

(EY) 2 ~ (EY) 2 

V EY-Y- 
= ^ Y) 2 J ~l + o(l) since EY^oo 

— L_L_ i + (i) 

(l + /(n))E^EW, 

" E^w, 1 + 0(1) 

= /(n) + o(l) = (l) 

□ 

In order to prove Lemma O We use the "sprinkle" from t c — e to t c + e. 
We first run the Bohman- Frieze process up to time t c — e and divide the 
vertices into three parts. 

(1) Small : vertices in components of size < rri\e~ 2 

(2) Medium: vertices in components of size between m\e~ 2 and ni2e~ 2 

(3) Large: vertices in components of size > m,2e~ 2 . 

Call the respective subgraphs Gs, Gm and Gl on these vertices. The con- 
stants mi < ?7i2 will be determined later in the proof. 

Proposition 2. Whp at time t c — e the above sets of vertices satisfy: 

(1) \G M \ ~ c M en 

(2) \G L \< c L en 

where cm and cl are constants depending on mi and m^. 

_2 

Proof. Using Theorem[3]we compute \Gm\ ~ X^m 1(: -2 ( x «(^c — e )+°(l)) n ~ 

£™ 2 ^L 2 ((c + 0(e))i- 3 / 2 e-( d+ °^ t2i + o(l))n ~ c M en for a constant c M 
depending on mi and 7712- For \Gl\, we know from the concentration of 
susceptibility that for some constant c, at t c — e, J2 V \^-( V )\ — T whp, 
so the number of vertices in components larger than m2t~ 2 is less than 
jLm = ^L en . □ 

m 2 e m 2 

Next we will run the Bohman- Frieze process from time t c — e to t c + e and 
show that whp for an appropriately chosen constant Ki, two separate con- 
nected components of size > K[e~ 2 logn form in the Gm subgraph without 
joining any vertices in Gl- If the giant component at t c + e contains vertices 
from Gl, then either of these two components serve as the lower bound for 
the second-largest component. If not, the smaller of the two serves as the 
lower bound, since the two components are not joined to each other. 
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We run the Bohman-Frieze process as follows. Call a pair of potential 
edges (ej, /j) green if e« does not connect two isolated vertices at time t c — e 
and fi connects two medium components. Call all other pairs of edges brown. 
A randomly selected pair of edges will have probability q = (1 — xf(t c — 
e)) • c^e 2 + o(l) of being green. Assign a uniform [0, 1] random variable 
independently to each pair of edges (ordered, with repetition, brown and 
green pairs). At each step of the process we flip an independent coin. With 
probability q we pick a random green pair of edges and add its second edge 
to the graph. With probability (1 — q), pick the brown pair of edges with 
the lowest random number assigned, add the appropriate edge according to 
the Bohman-Frieze rule, and discard the pair. Since the edges added from 
the green pairs do not touch isolated vertices, their choice will not affect the 
choice of edge from the brown pairs. Similarly, the choice of brown edges 
will not affect the choice of green edges. 

Now we describe a random graph, the lower bound graph, coupled to the 

Bohman-Frieze process. We add the second edge of the same green pairs as 

above (a random choice of Bm(m,q) medium-medium edges), calling these 

edges green edges, then add both edges from all pairs (brown and green) 

whose random variable is < -Mn, calling these edges brown edges. 

I2J 

The following facts hold about this coupling: 

1. The green edges in the lower bound graph are the same edges as the 
Bohman-Frieze process adds from green pairs; the edges from brown 
pairs in the Bohman-Frieze process are a subset of the brown edges in 
the lower bound graph. 

2. Every edge in the graph is added independently with probability p^ ~ 
Y as a brown edge in the lower bound graph. These edges are inde- 
pendent of the green edges added. 

3. If a component of size > iT/e~ 2 logn forms from green edges in Gm 
which is not connected to any large vertex in the lower bound graph, 
then there is also such a component in the Bohman-Frieze process. 

Now we analyze the lower bound random graph. Using a standard trans- 
formation we assume that each potential green edge is added independently 
with probability p g = ^- . 

Proposition 3. There exist constants K\ and K m so that whp more than 
n o.s components of size between K[,e~ 2 logn and K m e~ 2 \ogn form in Gm- 

Proof. Call the connected components in Gm at time t c — e 'nodes'. We 
show that whp more than n ' 8 trees of k = log n nodes form in the lower 
bound graph with the addition of the random green edges. Let X be the 
number of such trees. From Proposition [2] we know that the number of nodes 
is between 

c M e 3 n 

^1 = , 

m 2 

_ c M e 3 n 

mi 



15 



The probability that two nodes join together with the addition of the random 
green edges depends on the size of the nodes and is between: 



Pi 

Pu 



ije- 3 



n 
n 



So using the exact formula for the number of trees on k 
we calculate 



logn nodes, 



EX > 



Nfk k - 2 (c g mje- 3 ) k - 1 
k\n k ~ 1 

{c g m\m2C M ) k e 3 



exp 



c„rate 



2^-3 



n 



-N u k 



~ n 



~ n ■ 



c g ml^/27rk 3 / 2 



exp 



c„m5 



mi 



-=^, W2 CM .exp 



777-1 



Since each node has size between 



2 and ni2e 2 , setting K m = finishes the proof. 



which, for K[ small enough, is > n 0,8 . 

We now use Chebyshev's inequality via Lemma H] to show concentration. 
Let Tj be the indicator random variable that a set i of k nodes is in fact an 
isolated tree. Then the number of isolated trees of size k is ^ Tj. If trees 
i and j overlap, then cov(Ti,Tj) is negative since the two sets of vertices 
cannot simultaneously be isolated trees. If the two trees do not overlap then 

E(T^T j )<ET i ET j (l- Pu )- k2 . 

bmce (1 -p u )~ k - 1 = 0(n -1 log 2 n) , we use Lemma H] to conclude that 
EiTi-E^Ti whp. 

□ 

Call the above components 'good' components. Now we show that whp 
at least two good components avoid Gl and the rest of Gm with the addition 
of the brown edges. Fix a good component, call it Y. First we bound the 
number of small vertices that are connected to Gm arid Gl, excluding Y, 
with the addition of the random brown edges. 

Proposition 4. For some constant Cb s , whp the number of small vertices 
connected to Gm or Gl at time t c + e is < Cb s en. 

Proof. Let F = Gm U Gl \ Y. \F\ < (cl + Cjvf)en =: CFtn. Starting 
with the vertices in F we run a breadth-first search to explore the small 
vertices connected to F . Using Theorem [3j and approximating the sum 
Y^i^i i x i(tc — e) by an integral, we see that the average component size 
of small vertices is < Cv ^~ whp. Each brown edge is added independently 
with probability so we can bound above the size of the F-connected set 
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of small vertices with a branching process in which each vertex has Pois(o,e) 
descendant nodes, each of which has a number of vertices distributed as the 
size of a randomly chosen small node; in particular, the mean number of 
descendants in one node is < Cy ^" , If Z$ is a random variable representing 
the total number of descendants of one vertex, then EZj = ^/miccb < 0.9 
for a small enough choice of mi. For the branching process to reach size 
cpen + ten, we must have 

CFen+ten 

Z{ > ten 

i=l 

Since EZ,; < 0.9, for t > 10cp, a Chernoff bounds shows that the probability 
of this occurring is exponentially small. Letting q> s = YLcp completes the 
proof. □ 

Proposition 5. The 'probability that component Y does not join any vertex 
in Gl with the addition of the brown edges is > n~ . 

Proof. Y has size between -fQe -2 logra and Ki^e~ 2 logn. The vertices that 
are potentially dangerous - vertices that could join Y to Gl - are the large 
vertices, the other medium vertices and the small vertices which are con- 
nected to either of the two sets. Whp the total number of these dangerous 
vertices is < (q, s + cl + cm) en. 

The probability that Y avoids all the dangerous vertices with the addi- 
tion of the random brown edges is therefor 



> ^ 0,6^ ( c bs+CL+c M )tn-\Y\ 

> ^ Q,e^ {c bs +c L +c M )enKi^<i- 2 logn. 
^ n -Cb(c b s+c L +c M )Ki^ > n -0.1_ 



for K[ a small enough constant. □ 

Proposition 6. Whp at least two good components survives the brown edges 
without connecting to Gl or the rest of Gm ■ 

Proof. The expected number of good components that survive the brown 
edges is > n 0,7 . We bound the variance. Fix two good components, Y and 
Z. Let Xy and Xz be the indicator random variables that the respective 
components survive the brown edges. From Lemma UJ it is enough to show 
that E[Xy \Xz = 1] = EXy (l+o(l)). Conditioning on Z surviving, the num- 

2 

ber of edges that cannot be present changes by \Z\ ■ \ Y\ < Kf^e~^ log 2 n, 

and each factor is (1 — ©(^)), for a total change of (1 + o(l)) in the proba- 
bility. □ 

So whp at least two of the components larger than i<Qe~ 2 logn in Gm 
avoid Gl U Gm and all vertices in Gs connected to them. This provides the 
lower bound for the second-largest component. 
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5.3 Structure of Small Components 



We prove the second part of Theorem [2] that at t c + e, all components apart 
from the giant component are simple. Define the L-restricted susceptibility 
of G to be 

Sl{G) = - V |Q| 2 . 
n 

\d\<L 

Note that Sl{G) < L. At step j the probability that an edge is added within 
a component of size < L is 

1 Xiti? \ y |Q| 2 / X 1 (j)^ S L U) < L 

|Ci|<£ 



n" 1 £ — ' n' \ n 2 J n n 



The expected number of cyclic components of size < L created before time 
t c + e is < (*c + e)§^ < and so whp < L log n such components are created. 
The probability a complex component of size < L is created at step j is 

/ X 1 (j) 2 \ ICJICJ L 2 „. , 

v 7 C,,C fc cydic,[Ci[,[C fc [<i 

where Z{j) is the number of cyclic components of size < L at step j. By 
the above Z(j) < Llogn for all j whp, so the expected number of complex 
components of size < L created by time t c + e is < L ^° sn - If we set 
L = K u e~ 2 logn, this is o(l), and so whp there are no complex components 
of size < Ke~ 2 logn at t c + e, and therefore no complex components other 
than the giant component. 



6 Vertices in Small Components 

Before proving Theorem [31 we give in Section 16.11 a standard proof of the 
corresponding Erdos-Renyi result for comparison and another proof of it to 
illustrate a method based on a quasi-linear PDE and singularity analysis, 
which we will use in Section T6.2I to prove Theorem [3l 



6.1 The Erdos-Renyi Process 

If e is fixed, and either positive or negative, we have the following exact 
formula (see e.g. Chapter 11 of pQ): 

Xi (t c + e) = lim Pt[\K(v)\ = i] = U , 1 ) ■ (16) 

Now using Stirling's formula we compute asymptotics in i: 

Xi (t c + e)= l-J-i-We-t-W+M (1 + 0(1/*)). 
V 27T i + e 

As log(l + e) = e - e 2 /2 + 0(e 3 ), we have 

Xl (t c + C) = i±£Mz-3/2 e -(# + 0(e3)), (1 + 0(1/<))j 
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so 

Xi (t c + e ) = C(e)r 3 / 2 e~ d ^ 2i (1 + 0(l/i)), (17) 
where C(e) = + 0(e) and d(e) = ± + 0(e). 

Now we prove the same fact as above, but using a different method, 
the one we will later use to prove Theorem [3j In Section 16. 2\ we write the 
differential equations for the functions Xi(t) for the Bohman- Frieze process, 
describing the proportion of vertices in components of size i. For the Erdos- 
Flenyi process these equations are: 

zj(t) = -ixiit) + ^Y1 x k(t) x i-k(t) (18) 

k<i 

for i > 1, with initial conditions x\(0) = 1 and rcj(O) = for i > 2. We can 
define a function of two variables, 

P(t,z) =^2x i (t)z i . (19) 

i>l 

Our goal is to extract the asymptotic behavior of the coefficient xi{t) of 
P(t, z), using singularity analysis. 

1 8[P(t,z) 2 ] _ 



Multiplying by z % both sides of (|18p and summing over i, since | M 
35 ' 



P(M) dP »f^ i we fi n d a homogeneous quasi-linear PDE for the Erdos-Renyi 



process: 

dP(t,z) , . . ,dP(t,z) . , 

with an initial condition P(0, z) = z. Using the method of characteristic 
curves (see for example, Section 3.2 of [2]), we note that a solution to (f20j) 
defines a surface y = P(t, z) in (t, z, y)-space, and we obtain ODE's: 

dz , lX dy ,„_ s 

^ = -^- l) 'i = ° (2l) 

with initial values z(0) = zq and y(0) = yo, with yo = ^0 from the initial 
condition of the PDE. We can solve these ODE's to get: 

z{t) = ye^ (22) 

with y = yo = zq fixed. We write: 

z = F{t, y) with F(t, y) = ye*"*. (23) 

We now define a critical point with respect to 



t cr = inf t : 



9J%y) 
dy 



(t,l) = 0}. (24) 



Note that for all t, (t, l, l) satisfies (|22j) . Let us give some intuition for (|24p . 
We know that P(t, l) = ^Xj(t) amounts to the proportion of vertices in 
"small" components and that P(t, l) = l for t < l and P(£, l) < l for f > l. 
Furthermore, these "small" component are typically trees. Thus, loosely 
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speaking, for fixed t, y = P(t, z) would be equal to the generating function 
for vertex-rooted trees. From the generating function theory for trees (see 
e-g- [TT] ) , generating functions for various rooted trees exhibit a so-called 
square-root type singularity, which means that there is a value r where the 
first derivative of the inverse function of y vanishes. Thus we define t cr as 
the smallest t where the first derivative of the inverse function of y vanishes 
as z approaches 1. 

For fixed t, we can write z as a function of y, z = F(y) = F(t, y). If we 
invert and write y as a function of z, we find a singularity where ^ = 0. Let 

r be the y so that ^~"( r ) = 0, and p = F(t). We find the Taylor expansion 
of F(y) around r. 

z = p + F'(r)(y - r) + \f"{t)( V - r) 2 + 0((y - r) 3 ). (25) 

We rewrite and solve for y, using the fact that F'(t) = and y increases 
along the real z-axis (since the coefficients of y are non- negative) : 



In summary, for each fixed t, the function y = P(t,z) has the singular 
expansion around p(t) of the form 



P(t,z)=r(t)-J-^§-Jl--^ + ()[\ 



, F"(r(t))]j p(t) V P(t 

Now using the transfer theorem (Theorem VI. 3 [TB]) and the basic scaling 
(Chapter 6, [TT] ) , we extract the asymptotic behavior of the coefficient Xi(t) 
of P(t,z): 

x,(t) = ^lr 3 / 2 7 (tr(i + o(iA)), (26) 



where 



c(t) 
7(*) 



F"(r(t)Y 



p(ty 

Now we analyze c(t) and j(t). Using (|23p we can calculate 

^ = - tye^ (27) 
dy 

and 

J2 El 

1£ = ^e*-*!/ _ 2te*-^. (28) 
dy 2 

Setting ^ to gives r(t) = \ and 

p(t) = ie*- 1 . (29) 
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From the definition of t cr , we see that t cr = 1. Since p(l) = 1, p'(l) = and 
p"(l) > 0, we can write 



and this gives 



and 



p(l + e) = 1 + ^6 2 + 0(e 3 ) (30) 



7(1 + 6) = l-I e 2 + o (e 3) (si) 



7 (l + e )i = e iln 7 = e -d(e)6 2 i ^ 
where d(e) = —-^ ln(7(l + e) = \ + O(e). For c(t) we have: 



c(l + e) 



-2(1 + i«* + 0(e>)) 



;i + e)e e -2(l + e)e e 
'2(l + 0(e)). 

This recovers (|17[) . 

6.2 The Bohman-Prieze Process 

Ordinary Differential Equations. We describe how the differential equa- 
tions for the functions Xi(t) for the Bohman- Frieze Process are derived. The 

x (t—^) 

proof of convergence — 2 - = Xi(t) + o(l) can be found in [10J 



Begin with X\, the number of isolated vertices in the graph. The ex- 
pected change in X\ at step j is (neglecting loops and repeated edges): 

EA;fl(i) = _MlW 2 A_W^>. (33 ) 
n A \ n A ) n 

Each edge added advances time by — , so we can approximate the derivative 
of x\(t) by dividing the expected change of ^ by ^: 

x' l {t) = -x 1 (t)-xl(t) + x\{t). (34) 

By a similar calculation of the expected change in one round, we can write 
the differential equation for X2(t) and for general Xi(t), i > 2: 



x' 2 (t) = 2x\{t) - x\{t) - 2(1 - x\{t))x 2 {t) (35) 
i 

2 { 



x[(t) = 1(1 - x\{t)) ^ x k {t)x^ k {t) - i(l - x?(t))xi(i). (36) 



Proposition 7. Let t c be the critical point of the Bohman-Frieze process. 
The following hold concerning the functions Xi(t), i > 1: 

(i) For t < t c , 

oo 

5>;(i) = l. 

8=1 
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(ii) For t > t c , 

oo 

5>i(t) < 1- 

1=1 

Proof, (i) Fix e > 0. From the concentration of susceptibility and the asymp- 
totics of s\{t) [M], we know that for t = t c — e, 

- E \ C ( V )\ = S ^- n / 2 ) = s iW + (!) ^ -• 
n e 

for some constant K independent of e. 

Thus for any M there can be at most vertices in components of size 
> Me~ 2 . Choosing M large enough, we see that 

E x*(lc-e) > 1-^ + 0(1)- 
i=i 

Now let M — > oo to get the result. 

(ii) Janson and Spencer [2T] show that at t c + e there is a component 
of size > yen for some constant y. The vertices in this component are not 
counted towards any of the Xj(i)'s, so 

oo 

^Xiitc + e) < 1 - ye 
i=i 

□ 

Remark: In fact, Riordan and Warnke [33] have recently showed more: 
that X^ x «(^c) = 1 and that for t > t c , J2i x i(t) = 1 ~~ Vi where the size of 
the giant at time t is ~ yn. 



Quasi-linear Partial Differential Equation. We again consider the 
generating function 

oo 

P{t,z) = Y J X i {t)z\ (37) 



We multiply by z % the both side of ([36]) and sum over i. Since - dP ^' z ^ 



i=i 

J and sum over i. Since ^ — q^t 
P(t, z) 9P q^ , using also (|3^|) . ([35 ]) we obtain a non-homogeneous quasi-linear 
PDE for the Bohman-Frieze process: 

- z(l - x?(t))(P(t, *) - 1)^^ = *(* " l)x?(t). (38) 

Again we have the initial condition 

P(0,z) = z. (39) 

If we let f(t, x, P) = -z(l - zf (t))(P(t, z) - 1) and g(t, z) = z(z - l)x\{t), 
then we can rewrite (1381) as 



dP(t,z) ^dP(t,z) . , 

^' 1 +f{t,x,P) y z J =g{t,z). (40) 
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A solution to this quasi-linear PDE defines an integral surface y = P(t,z) 
in the (i, z, y)-space and we use the method of characteristics: 

dt dz dy 



f(t,z,y) g(t,z) 
or, 

§ = /(*,.,,),§ = *,.) (41) 

with initial values zq and yo satisfying zq = yo due to ([39]) . We solve the 
ODE's in (HQ: 

z(t) = z exp (- j\l - x\{s)){y{s) - 1) te\ (42) 
v(t)=yo+ [ z(s)(z{s)-l)xl(s)ds. (43) 
Since zq = yo we write: 



o 



y(t) = z(t) exp U (1 - xf(a))(y(s) - 1) rfsj + J z(s)(z(s) - l)xi(s) ds. 

(44) 

Let 

G(t,z,y) = -y+zexp (J (a))(y(s)-l)da)+jf «(a)(«(a)-l)x?(s)da. 

Equation (|44|) implicitly defines y as a function of i and z by 

G(t,z,y)=0. (45) 

Notice that for all t, y = z = 1 is a solution corresponding to the initial 
condition zq = yo = 1. We define 

£ cr = inf |t : 3 a solution to G(i, 2, y) = : 2 > 1, y < l|. (46) 

Proposition [7] shows that this is well-defined and that t cr = t c , where t c is 
the critical point of the Bohman-Frieze process. 



Singularity Analysis. For a fixed t, we let G(t, z, y) = G(z, y) and let 
F(z,y) = zex P (^(l-xl( S ))(y(s)-l)ds)+^ z(s)(z(s)-l)xl(s)ds, (47) 
so that 

G(z,y) = -y + F{z,y). 
Note that G(z, y) = defines implicitly y as a function of z. 
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Below we use the notation G z = ^ , G y = ^ , G yy = ,F Z = Q^ and 
F yy = j^j. We let (p,r) be the solution to 

G(p,r) = 
G y (p,r) = 0. 

Then, the singular implicit functions theorem (see e.g. Lemma VII. 3. in 
[17] ) says that for any 6 > 0, there exists a neighborhood Dq(p) of p such 
that at every point z £ Dg(p) — Rg(p), (where Re{p) = {z : z = p + s(cos9 + 
i sin 8) , s > 0} the ray of angle 8 emanating from p) the equation G(z,y) =0 
admits two analytic solutions yi(z) and 2/2(2) that satisfy 




//2U) - , v ;i - -) J/2 + r;(i : 



However, since the coefficients of y = y{z) (in the powers of z) are all non- 
negative, y{z) is increasing along the real z-axis and therefore, y\{z) is the 
right solution that we are looking for. For notational convenience let us use 
y(z) instead of y\{z). Since G z (z,y) = F z (z,y) and G yy (z,y) = F yy (z,y), 
we have 



V F vviPi T ) V P> V P 

In Proposition [8] we show that 2 p Fz (p^) ^ ^- 

In summary, for each fixed t, the function y = P(t, z) has the singular 
expansion around pit) of the form 



Pitz) = Tit) - J W)FMt)Mt)) L _ M 1/2 + ( 1 _ jl 
[, ) {) V F «>W)rtt)) V Pit)) + V Pit) 

Now we use the transfer theorem (Theorem VI. 3 [16J) and the basic scaling 
(Chapter 6, [IT]) to extract the asymptotic formula of the coefficient X{it) 
of P(t,z): 

x i (t) = ^=r a ^(t) i (i + o(i/i)) t (48) 

where 



' V f »»w*). t W) 

7«) = ^- (50) 
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Existence and Properties of p. In order to simplify notation in the 
proof we now make some definitions: 

u(t,y) = exp^ (1 -xj(s))(y(s) - l)dsj, 

v(t,z) = / z(s)(z(s) — l)x\ (s)ds, 
Jo 

q(t,z) = / (2z(s) - l)xl(s)ds, 
Jo 

1 - s?(t) 

a(t) = ~4(f) ' 

P{t) = [ (l-xl(s))ds. 
Jo 

Given a fixed t, we let 

u(y) = u(t,y), v(z)=v(t,z), q(z) = q(t,z), a = a(t), j3 = (5{t). 
We rewrite ([37]) as 

F(z,y) = zu{y)+v{z) (51) 
and differentiate it with respect to y: 

F y (z,y) = zu(y)^j (l-x\{s)){y(s)-l)ds 

= zu(y) / (1 - xl(s))ds 
Jo 

= Pzu{y). (52) 

Therefore, a solution (p, t) to G(z,y) = and G y (z,y) = (equivalently 
y = F(z,y) and F y (z,y) = 1) must satisfy 

y = F(z,y) = zu(y) +v(z), l = F y (z,y) = f3zu(y). 

That is, 

1 1 / \ 

We let p be the smallest positive real- valued solution of the function y = y{z) 
that satisfies the equation 

zpu(l/P + v(z)) = 1. 

Pringsheim's theorem (Theorem IV. 6 in (T7J) guarantees the existence of 
such a solution, since the coefficients of y = y{z) (in the powers of z] are all 
non- negative. 

Below in Propositions El M and QUI we prove that > 0, p'(t c ) = 0, 

and p"(t c ) > 0. 
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Proposition 8. 

Fz(p,r) 



> 



F yy(p, T ) 

Proof. Differentiating ()5ip with respect to z we have 



d 



u(y) + ^- z [fz(s)(z(s)-l)xl(s)ds) 



dz\J 
= u(y) + q(z), 

and in particular, we have F z (p,r) = u(t) + q(p). 
We differentiate (j52[) with respect to y: 
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and we have F yy {p,r) = (3 pu(r). 

Since u(r),q(p), (3, p > 0, we have F z (p,r) > and F yy (p,r) > 0. □ 

Proposition 9. 

p'(i c ) = 

Proof. By G{(t,z,y) we mean the derivative of G(t,z,y) in (t,z,y) with 
respect to the i-th variable. 

We differentiate G(t,z,y) = G(t, z(t),y(t)) with respect to t: 

^G(t,z(t),y(t)) = G 1 (t,z,y) + G 2 (t,z,y)z'{t) + G 3 (t,z,y)y'(t), 
where 

= + (53) 

G 2 (t,z,y) = u(t,y) + q(t,z) 

G 3 (t,z,y) = -1 + P(t)zu(t,y). (54) 
As p(t),T(t) satisfies = G(t, p{t),r(t)), given a fixed t, we have at 

(p,T) = ( P (t),T(t)), 

= G 1 (t,p,T) + G 2 (t,p,T)p'(t) + G 3 (t,p,T)r'(t). (55) 

But 

d^r) = P u(t)(1 - x\(t))(r ~ I) + P(P ~ l)x\{t) 
G 2 (t,p,T) = n(r) + g(r)>0 
G 3 (t,p,r) = -1 + #w(t) = 0, 

and so, from (|55p 

_ Gi(t,p,r) _ pu(r)(l - xj(t))(T - 1) + p(p - l)gj(t) 
PU " G 2 (t,p,T)~ u(T) + q{r) 

Since p(t c ) = r(i c ) = 1, we have 

p'(t c ) = 0. 

□ 
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Proposition 10. 

p"(t c ) > 

Proof. By Gij(t,z,y) we mean the derivative of Gi(t,z,y) in (t,z,y) with 
respect to the j-th variable. 

We differentiate G(t,z,y) = G(t, z(t),y(t)) twice with respect to t: 

d 2 

= - (Gxit, z, y) + G 2 (t, z, y)z'(t) + G 3 (t, z, y)y'(t)) 
= G u (t, z, y) + G 12 (t, z, y)z'(t) + G 13 (t, z, y)y'{t) 

+ {G 21 (t, z, y) + G 22 (t, z, y)z'(t) + G 23 (i, z, y)y\t)} z'{t) + G 2 (t, z, y)z"(t) 

+ (G 31 (t, z, y) + G 32 (t, z, y)z'{t) + G 33 (t, z, y)y' '(i)) y' \t) + G 3 (t, z, y)y"(t). 
Given a fixed t, we have, at (p, r), 

= G 11 (t,p,T) + G 12 (t,p,T)p'(t) + G 13 (t,p,T)T'(t) 

+ (G 21 (t, p, t) + G 22 (t, p, T)p'(t) + G 23 (t, p, r)r'(i)) p'(t) + G 2 (t, p, r)p"(t) 
+ (Gsi (t, p, t) + G 32 (i, p, T)p'{t) + G 33 (t, p, r)r'(t)) r'(t) + G 3 (i, p, r)r"(t). 

Since G 3 (t,p,r) = 0,G 2 (i,p, r) > 0, we have 

G u (t, p, r) + G 12 (t, p, r)^ (i) + G i3 (t, P, r)r'(t) 



G 2 (t,p,r) 

G 2 i(t, p, r) + G 22 (i, p, r)p'(t) + G 23 (i, p, r)r'(i) )p'(t) 



G 2 (t,p,r) 

(G 31 (t, p, r) + G 32 {t, p, r)p'(i) + G 33 {t, p, r)r / (t))r / (t) 



G 2 (t,p,r) 

Since p(i c ) = r (*c) = 1 and p'(t c ) = 0, 
Gn (t c , 1,1) 



P"(tc) 



G 2 (t c ,l,l) 

G 13 (t c , 1, 1) + G 3 i(t c , 1, 1) + G 33 (t c , 1, l)r'(t c ) )r'(t c ) 



G 2 (t c ,l,l) 

(56) 

We differentiate Gi(t,z,y) in ([53]) with respect to the first variable: 

G u (t,z,y) = -2zx 1 (t)x' 1 (t){y-l)u(t,y) + z(l-x 2 1 (t)) 2 (y-l) 2 u{t,y) 
+2z(z - f)xi(t)xi(t). 

Thus, given a fixed t, we have, at (p, r), 

G n (t,p,T) = -2pxi(t)x' 1 (t)(r-l)n(r) 

+p(l - x 2 (t)) 2 (r - 1) 2 u(t) + 2p(p - l)xi(t)x[(t), 
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and in particular, since p(t c ) = r(t c ) = 1, we have 

Gu(t c ,l,l) = 0. (57) 
We differentiate G 3 (t,z,y) in (j5^j) with respect to the first variable: 
d 

—G 3 (t,z,y) = G 31 (t,z,y) + G 32 (t,z,y)z'(t) +G 33 (t,z,y)y'(t), 
where 

G 31 (t,z,y) = f3'(t)zu(t,y) + P(t)zu(t,y)(l-xi(t))(y-l) 
G 32 {t,z,y) = P(t)u(t,y) 
G 33 {t,z,y) = (3(t) 2 zu{t,y). 

Thus, given a fixed t, we have, at (p, t), 

= G 31 (t, P ,T)+G 32 (t,p,T)p , (t) + G 33 (t,p,T)T , (t). 

and 

G 31 (t,p,r) = p\t)pu{T)+ppu{T){l-xl(t))(r-l) 

G 32 {t,p,T) = (3u(t)>0 

G 33 (t,p,r) = P 2 P u(t)>0, 

so 

l( s G 31 (t,p,r)+G 32 (t,p,T)p'(t) 
T[) G 33 (t,p,r) 

In particular, since p'(t c ) = 0, we have 

Jt+ \ G 3 i(t c , 1, 1) 

T(tc) = -Gs^f-M)" (58) 
From (J5BJ, ([SZD and H5SJ) we get 

Gi 3 (t c ,l,l)G 3 i(t c ,l,l) 



P"(tc) 



G 2 (t c ,l,l)G 33 (t c ,l,l) 



Note that G 2 (t c ,l,l) = «(1) + 9 (1) > 0, G 31 (i c ,l,l) = f3'(t c )u(t c ,l) > 
and G 33 (i c , 1,1) = /3 2 (t c )«(t c , 1) > 0. Finally, we differentiate G\(t,z,y) in 
(|53p with respect to the third variable: 



G 13 (t,z,y) = z(l-x 2 1 (t))u(t,y)+(3(t)z(l-x 2 1 (t))(y-l)u(t,y). 
Thus, given a fixed t, we have, at (p,r), 

G 13 (t,p,T) = p(l-X 2 {t))u{T)+P P {l-X 2 {t)){T-l)u(T) 

and so, 

Gi 3 (i c ,l,l) = (l-xl{t c ))u{t c ,l) >0. 
Therefore, we have p"(t c ) > 0. 



□ 
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Proof of Theorem [3]. From Equation (jH|), and Propositions l9l and \W\ 
c(t c + e) = c(t c ) + 0(e) 
P(tc + e) = l + ^6 2 + 0( e 3 ). 

If we let d = p"(t c )/2 > (see Proposition [TU|) . then since ^{t) = l/p(t) we 
have 

7(t c + e) = l-de 2 + 0(e 3 ). 
Let c = j^F=- Equation (jiS]) gives us: 

Xf (t c + e ) = C(e)r 3 / 2 e- D ^ 2t (1 + O {!/%)) (59) 
where C(e) = c + 0(e) and -D(e) = d + O(e). This gives Theorem [31 
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